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Abstract. Understanding the formation of the first stars and galaxies is 
a key problem in modern cosmology. In these lecture notes, we will derive 
some of the basic physical principles underlying this emerging field. We 
will consider the basic cosmological context, the cooling and chemistry 
in primordial gas, the physics of gravitational instability, and the main 
properties of the first stars. We will conclude with a discussion of the 
\mJ \ observational signature of the first sources of light, to be probed with 

r~| ■ future telescopes, such as the James Webb Space Telescope. 

O ! !• Introduction 

^: 

^ , The first sources of light fundamentally transformed the early universe, from the 

simple initial state of the cosmic dark ages into one of ever-increasing complexity 
(for comprehensive reviews, see Barkana & Loeb 2001; Bromm et al. 2009; Loeb 
2010; Whalen et al. 2010). This process began with the formation of the first 
^L. I stars, the so-called Population III (Pop III), at redshifts z ~ 20 — 30. These 

(SJ ' stars are predicted to form in dark matter minihalos, comprising total masses of 

00 , ~ 10^ M0. Current models suggest that Pop III stars were typically massive, or 

^^ ' even very massive, with M^, ~ 10 — lOOM©; these models also predict that the 

Cn . first stars formed in small groups, including binaries or higher-order multiples. 

^^ ' First star formation has been reviewed in Bromm & Larson (2004) and Glover 

(2005). There, you will find a survey of the key papers up to 2005. For the most 
recent developments, I recommend to consult the specialized literature (e.g., 
Turk et al. 2009; Stacy et al. 2010; Clark et al. 2011, Greif et al. 2011). 
^ I Once the first stars had formed, feedback processes began to modify the 

\—i • surrounding intergalactic medium (IGM). It is useful to classify them into 3 

categories (see Ciardi & Ferrara 2005): radiative, mechanical, and chemical. The 
first feedback consists of the hydrogen-ionizing photons emitted by Pop III stars, 
as well as the less energetic, molecule-dissociating radiation in the Lyman- Werner 
(LW) bands. Once the first stars die, after their short life of a few million years, 
they will explode as a supernova (SN), or directly collapse into massive black 
holes (MBHs). In the SN case, mechanical and chemical feedback come into play. 
The SN blastwave exerts a direct, possibly very disruptive, feedback on its host 
system, whereas the chemical feedback acts in a more indirect way, as follows: 
The first stars, forming out of metal-free, primordial gas, are predicted to be 
characterized by a top-heavy initial mass function (IMF). Once the gas had been 
enriched to a threshold level, termed "critical metallicity" (Zcrit), the mode of star 
formation would revert to a more normal IMF, which is dominated by lower mass 
stars (see Frebel et al. 2007). Chemical feedback refers to this transition in star 
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formation mode, implying that less massive stars have a less disruptive impact 
on their surroundings. The complex physics of pre-galactic metal enrichment, 
and the nucleosynthesis in Pop III SNe, are reviewed in Karlsson et al. (2012). 

The first, Pop III stars are thus predicted to form in small groups, in mini- 
halos with shallow gravitational potential wells. As they were typically very 
massive, they would quickly exert a strong negative feedback on their host sys- 
tems. Numerical simulations indicate that this feedback completely destroys the 
host, in the sense of heating and evacuating all remaining gas. There would 
therefore be no opportunity for a second burst of star formation in a minihalo. 
Furthermore, since all (most?) Pop III stars are massive enough to quickly die, 
there would be no long-lived system of low-mass stars left behind. The Pop III 
forming minihalos, therefore, are not galaxies, if a bona-fide galaxy is meant to 
imply a long-lived stellar system, embedded in a dark matter halo. The question: 
What is a galaxy, and, more specifically, what is a first galaxy?, however, is a 
matter of ongoing debate (see Bromm et al. 2009). And as we have seen, this 
question is intimately tied up with the feedback from the first stars, which in 
turn is governed by the Pop III IMF (top-heavy or normal). 

Theorists are currently exploring the hypothesis that "atomic cooling halos" 
are viable hosts for the true first galaxies (Oh & Haiman 2002). These halos have 
deeper potential wells, compared to the minihalos mentioned above; indeed, they 
have 'virial temperatures' of Tyjr ~ 10^ K, enabling the primordial gas to cool 
via efficient line emission from atomic hydrogen. We will further clarify these 
concepts below. It is useful to keep in mind that observers and theorists often 
employ different definitions. As a theorist, you wish to identify the first, i.e., 
lowest-mass, dark matter halos that satisfy the conditions for a galaxy. Ob- 
servers, on the other hand, usually aim at detecting truly metal-free, primordial 
systems. Recent simulation results, however, suggest that such metal-free galax- 
ies do not exist. The reason being that rapid SN enrichment from Pop III stars, 
formed in the galaxy's minihalo progenitors, provided a bedrock of heavy ele- 
ments. Any second generation stars would then already belong to Population II 
(Pop II). These questions, and the problem of first galaxy formation in general, 
have been reviewed in Bromm & Yoshida (2011), where the reader can again 
find pointers to the detailed literature. For a comprehensive overview of galaxy 
formation and evolution in general, including the situation at lower redshifts, 
< z < 5, see the monograph by Mo et al. (2010). 

The first star and galaxy field is just entering a dynamic phase of rapid 
discovery. This development is primarily driven by new technology, on the theory 
side by ever more powerful supercomputers, reaching peta-scale machines, and on 
the observational side by next-generation telescopes and facilities. Among them 
are the James Webb Space Telescope (JWST), planned for launch in ~ 2018, and 
the suite of extremely large, ground-based telescopes, such as the GMT, TMT, 
and E-ELT. The capabilities of the JWST are summarized in Gardner et al. 
(2006), as well as in the monograph by Stiavelli (2009). Complementary to them 
are ongoing and future meter-wavelength radio arrays, designed to detect the 
redshifted 21cm radiation from the neutral hydrogen in the early universe (see 
the review by Furlanetto et al. 2006). A further intriguing window into the epoch 
of the first stars is provided by high-redshift gamma-ray bursts (GRBs). These 
are extremely bright, relativistic explosions, triggered when a rapidly rotating 
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massive star is collapsing into a black hole (see the monograph by Bloom 2011). 
The first stars are promising GRB progenitors, thus possibly enabling what has 
been termed "GRB cosmology" (for details see Kouveliotou et al. 2012). 

There is a second approach to study the ancient past, nicely complementary 
to the in siiw observation of high-redshift sources. This alternative channel, often 
termed "Near-Field Cosmology" (Freeman & Bland-Hawthorn 2002), is provided 
by local fossils that have survived since early cosmic times. Among them are 
extremely metal-poor stars found in the halo of the Milky Way. The idea here 
is to scrutinize their chemical abundance patterns and derive constraints on the 
properties of the first SNe, and, indirectly, of the Pop III progenitor stars, such 
as their mass and rate of rotation (for reviews, see Beers & Christlieb 2005; 
Frebel 2010). Another class of relic objects is made up of the newly discovered 
extremely faint dwarf galaxies in the Local Group. These ultra-faint dwarf (UFD) 
galaxies consist of only a few hundred stars, and reside in very low-mass dark 
matter halos. Their chemical and structural history is therefore much simpler 
than what is encountered in massive, mature galaxies, and it should be much 
more straightforward to make the connection with the primordial building blocks 
(e.g., Salvadori & Ferrara 2009). 

The plan for these lecture notes is to provide a theoretical 'toolkit', con- 
taining the basic physical principles that are the foundation to understand the 
end of the cosmic dark ages. We will consider the overall cosmological context, 
the fundamentals of star formation as applicable to the primordial universe, the 
properties of the first stars, and the physical principles underlying the assem- 
bly of the first galaxies. We conclude with some useful tools of observational 
cosmology, allowing us to connect theory with empirical probes. For a detailed 
discussion of the phenomenology and the numerical simulations, we refer the 
reader to the review papers and monographs cited above. Again, the goal here is 
to focus on the basic framework, which will likely be relevant for many years to 
come, enabling the student to follow the rapid progress unfolding in the research 
literature. I invite you to join in on this grand journey of discovery! 

2. The Cosmological Context 

2.1. CDM Structure Formation 

We now have a very successful model that describes the expansion history of the 
universe, and the early growth of density fluctuations (Komatsu et al. 2011). 
This is the A cold dark matter (ACDM) model, as calibrated to very high pre- 
cision by the Wilkinson Microwave Anisotropy Probe (WMAP). Structure for- 
mation in this model proceeds hierarchically, in a bottom-up fashion, such that 
small objects emerge first, and subsequently grow through mergers with neigh- 
boring objects and the smooth accretion of matter. To characterize the resulting 
distribution of density fiuctuations, we measure the "overdensity" in a spherical 
window of radius R and total (gas -|- dark matter) mass M, where M = Air/SpR'^: 

Sm^'-^. (1) 

P 

Here, p is the mass density within a given window, and p that of the background 
universe at the time the overdensity is measured. Next, the idea is to place 
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the window at random everywhere in the universe, and to calculate the (mean- 
square) average: cj^(M) = {Sf^), where the brackets indicate a spatial average. 
The latter is closely related to the ensemble average, where one considers many 
realizations of the underlying random process that generated the density fluctu- 
ations in the very early universe (ergodic theorem). Here and in the following, 
all spatial scales are physical, as opposed to comoving, unless noted otherwise. 

Due to gravity, the density perturbations grow in time. This growth is 
described with a "growth factor", D{z), such that: 



a oc D(z) oc a 



l + z 



(2) 



The second proportionality is only approximate, and would be strictly valid in 
a simple Einstein-de Sitter background model. The expression for the growth 
factor is more complicated in a A-dominated universe (see Loeb 2010), but the 
Einstein-de Sitter scaling still gives a rough idea for what is going on at z ^> 1. 
Indeed, it is quite useful for quick back-of-the-envelope estimates. Early on, all 
fluctuations are very small, with 6m ^ 1; but at some point in time, a given 
overdensity will grow to order unity. One says that a fluctuation is in its linear 
stage, as long as 5m < 1, and becomes "non- linear" when 6m > 1- Formally, a 
critical overdensity of Sc = 1-69 is often used to characterize the transition. The 
behavior and evolution of the perturbations in their linear stage can be treated 
analytically, e.g., by decomposing a density field into Fourier modes. Once the 
fluctuation turns non-linear, one needs to resort to numerical simulations to 
further follow them to increasingly high densities. 

A basic tenet of modern cosmology is that the quantum-mechanical pro- 
cesses that imprinted the density fluctuations in the very early universe left 
behind a (near-) Gaussian random field. The probability that an overdensity has 
a given value, around a narrow range cISm is then: 



PiSM)d6_ 



M 



2-Kalj 



: exp 



^l 



2a 



M 



dSM ■ 



(3) 



One speaks of a 'V-sigma peak", when: 5m = '^O'M- Note that high-sigma 
peaks are increasingly unlikely, and therefore rare. One also says that such 
peaks are highly biased, and one can show that such peaks are strongly clustered 
(see, e.g.. Mo et al. 2010). The sites for the formation of the first stars and 
galaxies are predicted to correspond to such high-sigma peaks. To predict the 
redshift of collapse, or "virialization" redshift (see below), we demand: 5m (-2) = 
D{z)5m{z = 0) ~ (5c, or, using equ. (2) 



v(Jm{z = 0) 



1.69 , 



such that: 1 -|- Zvir — v(Tm{z = 0)/1.69, where (Jm{z = 0) is the rms density 
fluctuation, extrapolated to the present. On the scale of a minihalo (M ~ 
10^ Mq), one has: (Tm(-z = 0) ~ 10. For collapse (virialization) to occur at, say, 
Zvir — 20, we would then need v ~ 3.5. Thus, the first star forming sites are rare, 
but not yet so unlikely to render them completely irrelevant for cosmic history. 
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2.2. Virialization of DM Halos 

Once a given perturbation becomes non-linear {Sm ^ 1)) the corresponding 
dark matter (DM) collapses in on itself through a process of violent dynamical 
relaxation. The rapidly changing gravitational potential, d<p/dt, acts to scatter 
the DM particles, and their ordered motion is converted into random motion. 
The result of this "virialization" is a, roughly spherical, halo, where the kinetic 
and gravitational potential energies approach virial equilibrium: 2£'kin ~ — E'pot- 
Note that the total energy, Etot = -E'kin + -E'pot = — ^kin; is negative, which implies 
that the halo is bound. 

It is now convenient to define the gravitational potential (potential energy 
per unit mass), as follows: 

_ -E^pot ^ CMh 

Here, Mh is the halo mass (gas + DM), which is connected to the halo density 
and radius, often called "virial" density and radius, via 

Mh ^ yPviriivir • (5) 

The virial density, established after the virialization process is complete, is related 
to the background density of the universe at the time of collapse, at z^^j.: p^\j. ~ 
200p(2;vir)- In terms of the present-day background density, one has: 

p{z) = (1 + zfp{z = 0) = 2.5 X 10"^° gcm-3 (i + zf . (6) 

A very useful concept to gauge how the baryonic (gaseous) component will 
behave when falling into the DM halos mentioned above is the "virial temper- 
ature". The idea is to ask what would happen to a proton, of mass mn = 
1.67 X 10^^'^ g, when it is thrown into such a DM potential well. Through com- 
pressional heating, either adiabatically or involving shocks, the particle would 
acquire a random kinetic energy of: 

fcE^vir ^ ekin ^ -epot ^ — ^ , (7) 

where /cb is Boltzmann's constant. Combining the equations above yields 

where the normalizations are appropriate for a first-galaxy sytem, or, technically, 
and atomic cooling halo. For a minihalo, where M^ ~ W^Mq and z^h ~ 20, one 
has: Tvir ~ 1,000K. A related quantity is the halo binding energy 

Eh ^ Etot ^ ^ W^ ergs —^^^ — , (9) 

where the normalizations are again appropriate for an atomic cooling halo. For a 
minihalo, the corresponding number is: E^ ^ 10^" ergs. Comparing these values 
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Figure 1. Cooling rate of primordial gas as a function of temperature. 
Shown is the contribution from atomic hydrogen and helium {solid line), 
as well as that from molecular hydrogen ( dashed line) . Atomic hydrogen 
line cooling is very efficient at temperatures of T > 10^ K, whereas at 
lower temperatures, cooling has to rely on H2, which is a poor coolant. 
This is the regime of the minihalos, hosting the formation of the first 
stars. Adopted from Loeb (2010). 



with the explosion energy of Pop III SNe, where ii^sN — 10^^ — 10^'^ ergs, one 
gets the zero-order prediction that minihalos may already be severely affected by 
SN feedback, evacuating most of the gas from the DM halo (see Ciardi & Ferrara 
2005). The more massive atomic cooling halos, on the other hand, are expected 
to survive such negative SN feedback. This expectation is roughly born out by 
numerical simulations (see Bromm &; Yoshida 2011). 



2.3. Gas Dissipation 

To form something interesting, such as stars, black holes, or galaxies, gas needs 
to be able to collapse to high densities. Initially, such collapse is triggered by 
the DM potential well in halos, as the DM is dynamically dominant, and the 
gas (the baryons) just follow along. However, different from DM, the gas is 
collisional, and therefore subject to compressional heating. If this heat could 



First Stars and Galaxies 



not be radiated away, or dissipated, there would eventually be sufficient pressure 
support to stop the collapse. The key question then is: Can the gas sufficiently 
cool? A simple, but intuitively appealing and useful, answer is provided by the 
cassical Rees-Ostriker-Silk criterion, that the cooling time has to be shorter than 
the free-fall time: tcool < ^flf- If this criterion is fulfilled, a gas cloud will be 
able to condense to high densities, and possibly undergo gravitational runaway 
collapse. These important timescales are defined as follows: tg ~ 1/y/Gp and 
^cool — nk^T/A, where A is the cooling function (in units of ergscm^^ s^-*^). 

In Figure 1, the cooling function for primordial, pure H/He, gas is shown. 
One can clearly distinguish two distinct cooling channels, one at T > 10*^ K, 
where cooling relies on atomic hydrogen lines, and one at lower temperatures, 
where the much less efficient H2 molecule is the only available coolant. In the 
present-day interstellar medium (ISM), metal species would dominate cooling in 
this low-temperature regime, but, by definition, they are absent in the primordial 
universe. The first cooling channel governs the formation of the first galaxies 
(atomic cooling halos), since Tvir '^ 10^ K for Mh ~ W^Mq and Zvir ~ 10. 
First star formation in minihalos, on the other hand, is governed by the low- 
temperature, H2, cooling channel. The reason is again that minihalos typically 
have Tvir ~ 1,000K. 



2.4. Halo Angular Momentum 

Another important ingredient for early star and galaxy formation is angular mo- 
mentum. Current cosmological models posit that the post-recombination uni- 
verse, at z < 1, 000, was free of any circulation {V xv = 0). Angular momentum 
is thought to have been created through tidal torques during the collapse of 
slightly asymmetric overdensities. The idea is that neighboring perturbations 
exert a net torque on a given halo, thus spinning it up. It is convenient to pa- 
rameterize the resulting total angular momentum, J, of a virialized halo by a 
"spin parameter": 

GM^/^ \\Etot\J 



where l-Etotl is again the total halo (binding) energy, and Ej-ot ~ J^/(Mhi?yjj,) the 
total rotation energy. Numerical simulations, studying the large-scale evolution 
of the DM component, have shown that the spin parameter is distributed in a 
log-normal fashion with a mean of A ~ 0.04 (see Mo et al. 2010). 

For the DM component, the spin parameter is conserved during collisionless 
evolution. During the dissipational collapse of the gas, however, the spin pa- 
rameter can change. In particular, the system can be driven towards centrifugal 
support, where Ej-d ~ |ii^tot|- What is the radius of centrifugal support? As- 
sume that the baryons collapse further, in a fixed DM halo potential. It is then 
straightforward to show that the baryons have to collapse by a factor of ~ A~^ 
to reach centrifugal support: i?ccnt — Ai?vir- This then is the typical dimension 
of any large-scale disk that forms inside a dark matter halo. 
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3. First Star Formation: Basic Principles 

Primordial star formation shares many similarities with the present-day case, in 
terms of basic principles. It is therefore always a good idea to seek guidance from 
the rich phenomenology and understanding reached in classical star formation 
theory (see McKee & Ostriker 2007; Zinnecker & Yorke 2007; Bodenheimer 2011 
for reviews). In the following, in the spirit of our basic toolkit approach, we will 
address gravitational (Jeans) instability, the physics of protostellar accretion, 
and the properties of a generic stellar IMF. 

3.1. Gravitational Instability 

Consider a gaseous cloud of linear size L with a given mass density p and tem- 
perature T. Such a cloud will be unstable to gravitational runaway collapse, if: 
^sound > %) with tsound — L/cg being the sound-crossing time. The sound speed 
is Cs — y/k^T /mn oc T^'"^ . The intuition here is that the free-fall time measures 
the strength of gravity, in the sense that a smaller tg corresponds to a stronger 
force of gravity. Similarly, the sound-crossing timescale provides a measure for 
the strength of the opposing thermal pressure, where again a smaller tgound in- 
dicates stronger pressure forces. The above timescale criterion for gravitational 
instability can be written as: 

LI 

Cs VGp 
This inequality defines the Jeans length 

L > Lj ~ -^ , (12) 



with the interpretation that a density perturbation has to exceed a certain critical 
size, such that gravitational forces take over, and cannot be balanced by thermal 
pressure any longer. One then defines the Jeans mass, as follows 

/ J. x3/2, ^ x-l/2 

A/j~pL3-500Afe(^ (lol^^j ■ <13) 

where n ~ p/mn is the hydrogen number density, and the normalizations reflect 
typical values in Pop III star forming regions. A closely related concept is the 
Bonnor-Ehert mass, where: Mj ~ 2 x Mbe- 

3.2. Protostellar Accretion 

Every star, regardless of whether we are dealing with Pop I or Pop III, is as- 
sembled in an inside-out fashion, such that a small hydrostatic protostellar core 
is formed first at the center of a Jeans-unstable cloud. This initial core subse- 
quently grows through accretion. Feedback effects from the growing protostar 
will eventually terminate this process, thus setting the final mass scale of the 
star. The initial core mass is small, close to the so-called opacity lim,it for frag- 
mentation: Mp ~ 1O~^M0. This lower limit to the mass of a star can be derived 
by considering a cloud that is roughly in free-fall collapse. Collapse can proceed 
as long as the gas is able to radiate away the concomitant compressional heat. At 
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very high densities, however, the gas becomes opaque to this coohng radiation. 
Soon thereafter thermal pressure forces can stop the cohapse, and a hydrostatic 
core is born. The key question now is to address the growth via accretion, which 
happens in two distinct regimes, first as predominantly spherical accretion, and 
then as disk-dominated accretion. 

Spherical accretion For a roughly spherical gas cloud with a mass close to the 
Jeans mass, one can estimate the average accretion rate as follows: 

iff p V^ 

In the first step, we have assumed that gravity cannot move any material faster 
than the free-fall time. Using Cg oc T'^'^, it is sometimes convenient to rewrite this 
as: M ~ cf/G. Typically, one has for Pop I: T ~ lOK ^ M ~ 10'^ MqJT-^; 
and for Pop III: T ~ 300 K =^ M ~ 1O~^M0 yr~^. This two-order of magnitude 
difference in accretion rates is the basic physical reason why the first stars are 
believed to be more massive than present-day stars. Early on, protostellar ac- 
cretion is mainly spherical, but with time, material with higher specific angular 
momentum (J/m) falls in, and results in the formation of a protostellar disk. 
The bulk of the accretion is then shifted to a disk mode. 

Disk accretion Consider a circular shell, located at radius r, within a disk of 
surface mass density S = M(iisk/(^^disk)- ^^^ such a shell, one has 

dM ~ 27rrdrS . (15) 

Dividing by dt on both sides, yields the basic expression for the disk accretion 
rate: Mace — ^nrVr'S, with Vr = dr/dt being the radial (inflow) velocity. In a 
non-viscous disk, one would have: Vr = 0. Thus the reason that there is any 
infall is the presence of viscous shear forces, or, put differently, of friction. Such 
shear forces enable the outward transport of angular momentum, and the inward 
transport of mass. 

Intuitively, to characterize viscous transport of linear momentum, one can 
set: 

rvr ~ Amfpi' ~ z^ , (16) 

with u being the (kinematic) viscosity coefficient (in units of cm^s~^). The idea 
is that viscous transport involves the displacement of a particle (or fluid ele- 
ment) from a region of higher to lower momentum, e.g., a radial excursion in a 
Keplerian disk, where frot of r~^''^. Such a displacement involves, on average, a 
distance equal to the mean-free path, Amfp, and proceeds with the average parti- 
cle velocity, v. The latter is often equal to the thermal velocity (or sound speed). 
It has long been realized that the molecular (microscopic particle) viscosity is 
much too small to have an effect on astrophysical systems, such as protostellar 
disks. Therefore, an "abnormal" source of viscosity is needed. Often, turbulent 
diffusion (transport) is implicated. Due to the inherent difficulties involved in 
the physics of turbulent transport, it is customary to employ the phenomeno- 
logical a-prescription, introduced by Shakura & Sunyaev (1973): v = aHpCg, 
where Hp is the pressure scale- height, effectively a measure of the vertical disk 
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height. The underlying intuition is that one is deahng with subsonic turbulence 
(else, for supersonic turbulence, shocks would rapidly dissipate any turbulent 
energy), such that: a < 1. Note, that within this picture, the turbulent eddy 
size is bounded by the height of the disk, and the eddy turnover velocity by the 
sound speed. Also note that here, mesoscopic fluid elements (small compared to 
the total system, but large compared to individual atoms) have taken over the 
role of the individual atoms (or molecules) as fundamental carriers of linear mo- 
mentum transport. From numerical simulations, we have the further constraint 
that, typically: a ~ 10~^ — 1, depending on the nature of the torques involved 
(gravitational, hydrodynamical, or magnetic). 

We can now re-write our expression for the disk accretion rate in a form 
that can actually be evaluated in terms of easily accessible quantities: 

Mace ^ 27rzyS . (17) 

The precise derivation would yield a "Stt" instead of our factor of 2tt. Let us 
conclude this subsection by considering some numbers, typically encountered 
in Pop III disks (e.g., Clark et al. 2011): S ~ lO^gcm'^, Hp ~ 100 AU~ 
10^^ — 10^^ cm, and Cg ~ lO^cms"^. Since here, angular momentum transport 
is dominated by gravitational torques, it is appropriate to choose a ^ 1. We 
then estimate: Ma.cc ~ 10~^ — lO~^M0yr~^. Accretion onto a massive star 
proceeds for roughly the Kelvin-Helmholtz timescale, iacc ~ ^KH ~ 10^ yr, which 
in turn is the time it takes a (massive) star to reach the hydrogen-burning main 
sequence. One then has as a rough upper limit for the final mass of a Pop III star: 
-M* ~ Macc^acc ~ IOOMq. In reality, final masses will typically be smaller, since 
accretion may be terminated earlier on due to the negative radiative feedback 
from the growing protostar (e.g., Hosokawa et al. 2011; Stacy et al. 2012). 

3.3. Initial Mass Function 

The stellar IMF is a complicated function of mass, but it is often convenient to 
simply write it as a power law, valid for a given mass range. Specifically, one 
considers the number of stars per unit mass: 

where the present-day IMF is characterized by the famous Salpeter slope of 
X = 2.35. To understand what the typical outcome of the star-formation process 
is, one can ask: Where does most of the available mass go? Or, put differently: 
What is the average stellar mass? This can be calculated as follows: 

rM^p cm,j^ 2-x MuV"^ - M/-^ 

JMio„ dM"^*^ "P low 

where in the last relation, we have used the Salpeter value for x, and Mio„ 
and Mup are the lower and upper mass limits, respectively. In general, one can 
neglect all terms involving M^p above, as long as x > 2. This means that a 
Salpeter-like IMF is dominated by the lower-mass limit. For Pop I (present-day) 
stars, one often takes Miow — O.IMq and Mup ~ IOOMq, such that M ~ O.BMq, 
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Figure 2. Zero-age main sequence (ZAMS) for very massive stars. 
Shown is a comparison between the Pop III {left line) and Pop I cases 
(right line). Stehar luminosity (in units of Lq) is plotted vs. effective 
temperature (in K). Diamond-shaped symbols: Stellar masses along the 
sequence, from IOOMq (bottom) to IOOOMq (top). As can be seen, 
the Pop III ZAMS is shifted to higher values of effective temperature, 
asymptotically reaching T^s — 10^ K. Note that current estimates for 
the Pop III mass are somewhat less extreme. Adopted from Bromm et 
al. (2001). 



whereas for Pop III, current theory postulates a characteristic (typical) mass of 
M ~ 4Miow ~ a few x WMq, assuming that the Pop III IMF were to exhibit 
a slope similar to Salpeter. The latter assumption is not at all proven, and just 
serves as a zero-order guess. 



4. First Stars: Basic Properties 

In the following, we will address the fundamentals of the structure and evolu- 
tion of massive. Pop III stars. Many of these relations are strictly valid only for 
stars with Af* > IOOMq. The stellar physics in this high-mass regime, although 
involving extreme energies and temperatures, is of an appealing simplicity. Re- 
cent simulations have revised the Pop III mass-scale downwards to more modest 
values [M^ ~ 10 — 50Mq), but the simple high-mass physics nevertheless pro- 
vides us with very useful order-of-magnitude estimates. The discussion here is 
largely based on Bromm et al. (2001), and we refer the reader to this paper, and 
references therein, for a more detailed (numerical) treatment. 

In this spirit, we start by assuming a typical Pop III stellar mass of M* ~ 
IOOMq. Massive stars are dominated by radiation pressure (see below), and 
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their structure can therefore approximately be described as a polytrope with 
index n = 3. For such a configuration, one can derive the following mass-radius 
relation: 

which is valid for a star on the hydrogen-burning main sequence. It is then 
straightforward to estimate the average mass: (p) = M^/{Att/3R^) ^ lgcm~^, 
which is of the same order as for the Sun. Somewhat surprisingly, massive 
stars, therefore do not involve extreme densities. We next wish to estimate the 
temperature, both at the surface and in the deep interior. To accomplish this, 
one has to consider the pressure inside the star. 

4.1. Radiation Pressure 

In massive stars (M* > 50Mq), radiation pressure starts to be important, com- 
pared with the usual gas (thermal) pressure. For ultra-relativistic particles, 
in our case photons, one has the general relation between pressure and energy 
density (in units of ergcm"^): Prad = l/3ui-ad- The latter is given by the Stefan- 
Boltzmann law: 

Wrad = ^-rAr^ = OradT^ , (21) 

15 h-^cr 
where h is Planck's constant. The radiation constant has the numerical value: 
Orad = 7.57 X 10^"*^^ crg K^^ cm^^. Thus, we can write the radiation (photon) 
pressure, as: 

-Prad = o = O^radT ■ (22) 

4.2. Hydrostatic Equilibrium 

The motion of any fluid element in a star is governed by the Euler equation: 

P^ = -VP + pg , (23) 

where D/Dt is the substantial (moving with the fluid) derivative, and g the 
gravitational acceleration. Assuming spherical symmetry and D/Dt = 0, we 
get the equation of hydrostatic equilibrium, which is crucial for the structure of 
stars: 

^-Prad ^ _ ^ ^24) 

dr r^ 

Here, we have assumed P = Pgas + -Prad ^ Prad, as is appropriate for massive 
stars. Above, we introduce the Lagrangian mass coordinate, m = m{r), which 
measures the total mass contained within a shell of radius r. We now re-work 
this equation in an order-of-magnitude fashion: 

—f5— - ^P^-^ ' (25) 

resulting in an estimate for the typical interior temperature in a massive Pop III 

r,= fi^^)"*~10'K, (26) 

V arad P* J 
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The next question is: How is this related to the surface temperature? In address- 
ing it, we need to consider how energy is transported in the stellar interior via 
photon diffusion. 

4.3. Radiative Diffusion 

Let us begin with a thought experiment. For the moment, (wrongly) assume 
that photons are free to escape from the hot interior of a star, which would 
imply negligible opacity (i.e., the capability of stellar gas to bottle-up radiation). 
The timescale for such hypothetical, direct escape would be: tdircct = R*/c. 
Within the same setup, we would estimate the stellar luminosity as follows: 

L = — ^^2_^ ~ ^ * ~ aradci?*^T/ . (27) 

LaT ^direct -^^ 

The radiation constant is related to the Stefan-Boltmann constant via: Orad = 
4o"sb/c, where: o"sb = 5.67 x 10~^ ergs~^ cm~^ K~^. We can thus re-phrase the 
equation above in a form similar to the standard blackbody expression: 

L = ATTRlasBTf . (28) 

Now, why is this reasoning incorrect? The answer is that in reality photons 
are trapped inside a star. Indeed, stellar material is typically extremely opaque 
to radiation, such that the photons engage in a very slow diffusion process, 
and eventually leak out from a narrow layer close to the surface, the stellar 
photosphere. Effectively, we are dealing with a near-blackbody, characterized by 
an effective (photospheric) temperature, T^q. If we further replace the (incorrect) 
direct escape timescale with the (correct) diffusion time, we have: 

^ = ^ - ^ = ^^R>SbT^, , (29) 

such that: 

T^^^^^Y'tj. (30) 

Our remaining task is therefore to estimate the diffusion timescale. 

We can do this by modeling diffusion as a (nearly-isotropic) random walk. If 
l^ is the photon mean-free path, and Nsc the number of scatterings needed for a 
photon to escape from the star, basic random- walk theory yields: i?* = \/T^l-y. 
Stellar material is extremely opaque, and one typically has: l^ ~ l/(ncjT) — 1 cm. 
Here, we have assumed that the opacity is dominated by electron (Thomson) 
scattering, with an interaction cross-section of iTt = 0.67 x 10~^^cm^. The 
diffusion timescale can then be estimated via: 

*diff - —^ - — • (31) 

Combining everything, we finally get: 

Teff ~ f -^ j Tj - IQ-^Ti ~ 10^ K . (32) 
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Massive Pop III stars are thus extremely hot (compare to the solar Teff^0 ~ 
6, OOOK), which in turn implies a number of key consequences for early cosmic 
history. Among them is a very high specific (per unit stellar mass) production 
rate of ionizing photons. It is useful to remember: iVion — 10^^ s~^Mq . In 
addition, such hot stars can also produce copious amounts of He-ionizing photons, 
including those required for the second ionization of He (54 eV). Normally, such 
extremely energetic radiation is not produced by stars, and instead originates in 
quasar (accreting black-hole) sources. 

4.4. Stellar Luminosity 

Combining the expressions above, we find for the Pop HI stellar luminosity: 



L = 4vri?^.SBTj,:.10^L,(^) . (33) 



To repeat, in deriving this, we have assumed that: (i) pressure is dominated by 
radiation (appropriate for all massive stars), and (ii) opacity is dominated by 
Thomson scattering (appropriate for metal- free stars). The L oc M* scaling is 
characteristic for very massive stars. Indeed, the luminosity for massive Pop HI 
stars is close to the theoretical upper limit, the so-called "Eddington luminosity". 
The Eddington limit can be derived by demanding that the radiation pres- 
sure associated with a given luminosity does not exceed the force of gravity. 
Specifically, let us consider the balance of forces exerted on a combination of one 
proton and one electron, where the proton provides (most of) the mass, and the 
(free) electron the opacity (via Thomson scattering): 



GmuM^ Ap^ 



(34) 



r2 ~ At ' 
where Ap^ is the absorbed photon momentum per particle: 

Ap^ AE^/c 1 L 

Thus, one has L < LedD) where the Eddington luminosity is: 

Ledd = ^^^^^M. = 1.3 X 10- ergs- (^) . (36) 

This is very close to the estimate in equation (33), demonstrating that L ~ Ledd 
for very massive (M^, > WOMq) Pop HI stars. Again, these relations still give 
reasonable ballpark numbers for less extreme masses. 

In Figure 2, the Pop HI zero-age main sequence (ZAMS) is shown, together 
with the comparison Pop I sequence, in both cases for very massive stars. As 
is evident there, the effective temperatures asymptotically approach constant 
values, Tcs ~ 10^ K for Pop HI, and ~ 60, OOOK for Pop I. Luminosities, how- 
ever, are very simlar for a star of given mass, regardless of metallicity. The 
reason that Pop HI stars are so much hotter (bluer) is their much more compact 
configuration, which is in turn a reflection of the reduced opacity in the outer 
envelope. 
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Finally, let us estimate the lifetime of massive Pop III stars. Assmning that 
such stars radiate close to the Eddington limit, and that they are almost fully 
convective, one has the simple relation: 

«.=.5:55Iil^,3xlO«yr, (37) 

-t^EDD 

where the factor of 0.007 is the efficiency of hydrogen burning. These are very 
short lifetimes, compared with the solar t^^Q r^ lO^'^ yr. The implication is that 
any feedback effects exerted on the surrounding IGM by the Pop III star are 
almost instantaneous. For example, the pristine gas in the neighborhood of the 
Pop III star is rapidly converted into metal-enriched material, such that any 
subsequent round of star formation will already lead to Pop II stars. Also note 
that for very massive stars the stellar lifetime becomes independent of mass. 



5. First Galaxy Assembly 

With the emergence of the first galaxies, we witness the onset of supersonic 
turbulence, which is expected to have important consequences for star formation 
(reviewed in Mac Low & Klessen 2004; McKee & Ostriker 2007). To see this, 
we estimate the Reynolds and Mach numbers, as follows. The Reynolds number 
measures the relative importance of inertia and viscous forces: 

inertial acceleration t^ VL 



Re = — ^^^^^.^..^.. ^ _j_ _ ^ ^ ^3j 



viscous acceleration 



where V, L, T = L/V are charcateristic velocity, length, and time scales, re- 
spectively. For the first galaxies, we can estimate: V ^ t^vir ~ 10kms~ , 
L ~ i?vir ~ Ikpc, and v ~ AmfpC^ ~ 10"*^^ cm^ s~^. For the last estimate, 
we have assumed: A^fp = l/(n(Tcoii) ~ 10^^ cm, if the number density is typi- 
cally n ~ lO^cm"^, and we consider collisions between neutral hydrogen atoms 
(o'coU ~ 10~^^ cm^). For the typical particle velocity, we assume the sound-speed 
of H2-cooled gas (c^ ~ lkms~^). The Reynolds number in the center of the 
first galaxies is therefore: Re ~ 10^, indicating a highly-turbulent situation. The 
Mach number is: Ma ~ V/cg ^ v^\^/cs ^ 10, indicating supersonic flows. 

Supersonic turbulence generates density fluctuations in the ISM. Statis- 
tically, these can be described with a log-normal probability density function 
(PDF): 



f{x)dx = — ===:exp 



' X 



(x - fixY 



dx , (39) 



where x = ln(p/p), and Hx and a^ are the mean and dispersion of the distribution, 
respectively. The latter two are connected: /Xj, = — cj^/2. This relation can easily 
be derived by interpreting the PDF above as a distribution (of x) by volume. 
One then has for the volume-averaged density: p = J pf{x)dx = p J e^ f{x)dx, 
which yields the desired result. Numerical simulations have shown that the 
dispersion of the density PDF is connected to the Mach number of the flow: 
cj^ ~ ln(l -|- 0.25Ma^). Inside the first galaxies, one finds values close to ax — 1. 
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Figure 3. Density distribution in supersonically-turbulent flows. Ini- 
tially (t = 0), the density PDF is log- normal. With time, a pronounced 
power-law tail develops towards high density. This is a reflection of the 
gas self-gravity. Adopted from Kritsuk et al. (2011). 



In Figure 3, we show an illustrative example from a numerical simulation of 
isothermal, supersonic turbulence. There, it is also evident that the self-gravity 
of the gas imprints a power-law tail toward the highest densities, on top of the 
log-normal PDF at lower densities, which is generated by purely hydrodynamical 
effects. 



6. Observational Signature 

It is useful to recall the derivations of some of the key quantities in observational 
cosmology, specifically luminosity and angular-diameter distance, as well as the 
observed flux. It is also useful to assemble estimates for their typical values as 
encountered in the first galaxies. Further details are given in the monographs 
mentioned above. In addition, a comprehensive survey of high-redshift galaxy 
observations, including a description of the key methods and tools, is given by 
Appenzeller (2009). 



6.1. Cosmological Distances 

In analogy to the usual inverse-square law, the luminosity distance, di, is defined 
via: 

^-C'obs -t^em 



/< 



obs 



AtobsAA 



obs 



iTTdl 



(40) 
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where here and in the following we refer to quantities that are measured at z = 
with the subscript "obs", and with "em" to source-frame quantities (emitted at a 
given redshift z). Note that: AE^y^g = AE'em/ll + z), and Atobs = ^icm(l + z) 
relate small differences in energy and time in the two frames. To evaluate 
A^obs) carry out the following thought experiment: Imagine that you could 
somehow 'step outside' our universe, looking down at the scene from some 
(higher-dimensional) bird's-eye perspective. Such a perspective, which of course 
is completely inaccessible in practice, would allow you to measure distances, as 
they would appear today. Or, put differently, you somehow managed to stop the 
expansion of the universe, keep everything frozen at z = 0, and go about mea- 
suring the distance, with some appropriate measuring rod, from the observer 
(the telescope) to where the source would be located today. Recall that the 
source was much closer when it emitted the photon that we receive today, but 
has hence receeded due to cosmic expansion. This source-observer distance, r{z), 
is called comoving distance, or proper distance. Although it cannot be directly 
measured, but instead can only be calculated by assuming a theoretical model of 
the universe, this concept is nevertheless extremely useful in cosmology. Assum- 
ing a point source at a given redshift z, we can now write the (proper) size of the 
spherical surface over which the photons have been spread, as: AAobs = 47rr^(z). 
To calculate the comoving distance, consider the Robertson- Walker (RW) 
metric: 

ds^ = c^dt^ - a^dr^ , (41) 

where we have assumed a spatially flat universe, and where ds is the (invariant) 
space-time interval. Since they travel along null geodesies (ds = 0), one has 
for photons: dr = cdt/a (recall a = 1/(1 + z)). The RW metric describes any 
homogeneous, isotropic, and expanding universe. To fully specify the background 
cosmological model, we also need the Friedmann equation, governing d. The 
latter can in turn be derived from the Einstein field equations of general relativity 
(see Mo et al. 2010), yielding: 



^j = H{z) = Ho^Qmil + zf + nA , (42) 

where Hq, Q^, and Q\ are the Hubble constant, the density parameter for 
matter, and that for dark energy, respectively, as measured to very high precision 
by WMAP (Komatsu et al. 2011). In this expression, we again assume a spatially 
flat universe (zero curvature). We can now carry out the integration along the 
photon-geodesic: 

JQ dz' Ho Jo ^Om(l + z') + Qa 

where we use the Friedmann equation in the last step. 

We have now all the ingredients in hand to find an expression for the lumi- 
nosity distance: 

f,= I ^^^"^ = hiR (AA) 

•'°^' 47r(l + z)2r2 Atem 47rr2(l + z)2 • ^ ^ 
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Comparing with the definition above, we finally have: dLiz) = (1 + z)r{z). It is 
useful to memorize the ballpark number for a source at z ~ 10, appropriate for 
the first galaxies: dii^z) ~ 10^ Gpc. 

Next, let us derive the analogous expression for angular- diameter distance, 
dA, where we again start with a definition that follows basic, geometrical intu- 
ition. If a source at z, having a true (proper) transverse size of D, is observed 
to have an apparent angular size of Q, we define: @ = D/dA- And let us again 
assume our bird's-eye perspective, as before. How would the source appear at 
the present-day {z = 0), if it had just been coasting along with the expand- 
ing universe since the time that the photons, reaching us now, were originally 
emitted? The situation can be described with a virtual triangle, where: 

Djl + z) D 

giving us our result: dA = r{z)/{l-\-z). Note that the two fundamental distances 
of observational cosmology are connected: d^ = c?a(1 + z)'^, such that it suffices 
to remember only one. For a first galaxy, where D ~ 1 kpc, one finds: B ~ 
D/dA ~ Ikpc/lGpc '^ 10~^ ^ 0.2". The near-IR camera on-board the JWST 
(NIRCam) should thus be able to marginally resolve these sources. 

6.2. Observed Fluxes 

To estimate how bright a first galaxy is likely to be, we need to consider the 
observed specific fiux (fiux per unit frequency): 

AtobsA^obsAz/obs 47rr2AtcmAt'cm 47rd| 

To arrive at a zeroth-order guess, we assume that the total stellar mass involved 
in the starburst at the center of a first galaxy is: M* ~ 10^ Mq. If we further 
assume that we are dealing with a top-heavy Pop III burst, the stellar radiation 
will be characterized by T^g ~ 10^ K, corresponding to a peak frquency of fmax ~ 
10"*^^ Hz, and a total luminosity close to the Eddington-luminosity (see Section 4): 
L ~ Ledd ~ 10^^ erg s^"*^. The emitted specific luminosity is thus: Lu^m ~ 
-^EDo/^'max ~ 10^^ erg s^"*^ Hz^"*^. The observed (specific) fiux for a first galaxy 
at 2: ~ 10 is then: 

fu,ohs ~ 10-'' erg s-^ cm-2 Hz"! = 1 nJy . (47) 

The nJy is indeed the typical brightness level that the JWST is designed to 
image with NIRCam, thus reiterating the point that with this next-generation 
facility, we will get the first galaxies within reach of deep-field exposures. 



7. Outlook 

The next decade will be very exciting as we are opening up multiple windows into 
the cosmic dark ages. We will finally be able to close the remaining gap in the 
long quest to reconstruct the entire history of the universe, which began with the 
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pioneers of cosmology in the 1920s. There will be many opportunities to make 
important discoveries, e.g., in direcly detecting the first sources of light, and in 
working out a well-tested theorectical framework for star and galaxy formation 
at the dawn of time. Very likely, serendipity will play a crucial role. It is thus a 
good idea to equip oneself with a comprehensive set of tools, such as the basic 
physics covered in these lecture notes. After all, there is wisdom to the old adage 
that "fortune favors the prepared mind". 
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